translated by 谷歌翻译
End-to-End speech-to-speech translation (S2ST) is generally evaluated with text-based metrics. This means that generated speech has to be automatically transcribed, making the evaluation dependent on the availability and quality of automatic speech recognition (ASR) systems. In this paper, we propose a text-free evaluation metric for end-to-end S2ST, named BLASER, to avoid the dependency on ASR systems. BLASER leverages a multilingual multimodal encoder to directly encode the speech segments for source input, translation output and reference into a shared embedding space and computes a score of the translation quality that can be used as a proxy to human evaluation. To evaluate our approach, we construct training and evaluation sets from more than 40k human annotations covering seven language directions. The best results of BLASER are achieved by training with supervision from human rating scores. We show that when evaluated at the sentence level, BLASER correlates significantly better with human judgment compared to ASR-dependent metrics including ASR-SENTBLEU in all translation directions and ASR-COMET in five of them. Our analysis shows combining speech and text as inputs to BLASER does not increase the correlation with human scores, but best correlations are achieved when using speech, which motivates the goal of our research. Moreover, we show that using ASR for references is detrimental for text-based metrics.
translated by 谷歌翻译
我们为2022年MIP竞争开发的混合整数程序(MIP)提供了一个求解器。鉴于竞争规则确定的计算时间限制了10分钟,我们的方法着重于找到可行的解决方案,并通过分支机构进行改进 - 和结合算法。竞争的另一个规则允许最多使用8个线程。为每个线程提供了不同的原始启发式,该启发式是通过超参数调整的,以找到可行的解决方案。在每个线程中,一旦找到了可行的解决方案,我们就会停止,然后使用嵌入本地搜索启发式方法的分支和结合方法来改善现有解决方案。我们实施的潜水启发式方法的三种变体设法为培训数据集的10个实例找到了可行的解决方案。这些启发式方法是我们实施的启发式方法中表现最好的。我们的分支机构和结合算法在培训数据集的一小部分中有效,并且它设法找到了一个可行的解决方案,以解决我们无法通过潜水启发式方法解决的实例。总体而言,当用广泛的计算能力实施时,我们的组合方法可以在时间限制内解决训练数据集的19个问题中的11个。我们对MIP竞赛的提交被授予“杰出学生提交”荣誉奖。
translated by 谷歌翻译
translated by 谷歌翻译